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CIPRIAN DEMETER, MALABIKA PRAMANIK, AND CHRISTOPH THIELE 

Abstract. We prove bounds for multilinear operators on R'' given by multipliers which 
are singular along a k dimensional subspace. The new case of interest is when the rank 
k/d is not an integer. Connections with the concept of true complexity from Additive 
Combinatorics are also investigated. 
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1. Introduction 

Let n > 3 and d > 1. We consider a multiplier M{^^-^\ . . . , ^(")) on the vector space 

n 

r := {f := . . . ,&^) e (R'^)" : Y.^'^ = 0}. 

1=1 

This gives rise to the multi-linear operator on n — 1 functions on R*^ 

We will prove the following 
Theorem 1.1. Let V be a generic linear subspace ofT of dimension k >0, and assume 

0<k/d<n/2. (1) 
Assume the multiplier M : F ^ R satisfies 

|9"M(OI<dist(e,r')-'"', (2) 



Key words and phrases. Multilinear singular integral operators; fractional rank. 
AMS subject classification: Primary 42B20. 

1 



2 



CIPRIAN DEMETER, MALABIKA PRAMANIK, AND CHRISTOPH THIELE 



for all partial derivatives up to some finite order. Then 

T: X ■■■LP"-' ^ LP'", 
whenever 2 < pj < oo for each 1 <i <n and 

11 

_ + ... + _ = !, 3 

Pi Pn 

where p'^ is the conjugate exponent to Pn- 

The generic character of F' in Theorem 1.1 is understood with respect to the Lebesgue 
measure (for example). In fact, we will need V to satisfy some precise non-degeneracy 
conditions, and they are generically satisfied. To give the reader a grasp on what these 
conditions amount to, we will describe them now in the case d = 2. The case d > 3 
involves very similar considerations and will be described in Section 4.3. 

There will be two sets of non-degeneracy requirements. The first one is that T' can be 
parameterized by any k of the canonical variables. We will work a lot with the following 
parametrization. For 1 < i < n, let Gj : R'' be the linear functions such that 

,6) = (6,... 

G2{il, ■ ■ ■ ,^k) = iCd+l, ■ ■ ■ , C2d) 



Glk/d] + l{^l, ■ ■ ■ ,Ck) = (y^dlk/dl + 1, ■ ■ ■ ,^k,---) 

where the last entries of G[k/d]+i (the ones after the entry) and the entries of the 
remaining Gi {[k/d\ + 1 < i < n) are uniquely determined by the requirement that the 
function 

GiX ...xGn 

maps into F'. 

Let m be the smallest integer that is greater than or equal to k/d. We will use the 

notation (rf\ . . . ,^1*^) G R^ Let ii,... ,ik e {1, . . . , n} be pairwise distinct 

indices. Consider the following system of {k vector, or equivalently 2k scalar) linear 
equations in 2k variables . . . , E R, and coefficients Vj E R^. 

rG,,(f«)-G',(rp)) = ^i 

G,,(f«)=^2 

<Gai^''>)=v^ (4) 

When d — 2, Theorem 1.1 has the following precise formulation. 

Theorem 1.2. Let d ~ 2 and let T' be a linear subspace of F of dimension k > 0. 

Assume T' is the graph over every k of the canonical variables. Moreover, assume that 
the system (4) has a unique solution (for each choice of Vi ) for each pairwise distinct 
ii, . . . , ife e {1, . . . , n}. // the remaining hypotheses from Theorem 1.1 are satisfied, then 
its conclusion will hold. 



MULTILINEAR SINGULAR OPERATORS WITH FRACTIONAL RANK 



3 



It is not hard to see that the requirements in Theorem 1.2 are satisfied for a generic F'. 
The assumption on the compatibihty of the system (4) is one of many (similar in style) 
that work for our approach, and are guaranteed to hold generically. Various other possible 
alternative assumptions will become apparent from our later analysis. The minimal non- 
degeneracy conditions that are needed for Theorem 1.1 (or even for our approach) are 
probably very hard to find, and beyond the goal of this paper. We point out however that 
if r' is degenerate in the sense that it fails to be the graph over some particular choice of 
k canonical variables, the analysis of the operator T complicates to a significant extent. 
This has been observed and investigated in [1], in the case d — 2. 

Theorem 1.1 was proved in [8] in the case d — 1, so our theorem is only new in the 
case d > 2. There, the theorem is proved under just the first non-degeneracy assumption, 
that r' is the graph over any k of the canonical variables. The result in [8] is proved for 
a larger class of indices Pi. To simplify our exposition, we choose to prove our theorem in 
the locally case Pi > 2. 

A key parameter for our analysis is m, introduced earlier. We will refer to k/d as 
the rank of the operator. When this rank is an integer (and thus equal to m), or more 
generally, when m < n/2, Theorem 1.1 will follow by a pretty straightforward adaptation 
of the argument in [8] to the d— dimensional setting. The novelty here is that k/d can be 
fractional and sufficiently close to n/2 to allow for the "bad" case m > n/2. A first new 
case of interest where our theorem is applicable is when d = 2, k = 3, n = 4. 

A simplified version of our approach also gives an alternative proof to the result in [8] 
(the d = 1 case), at least in the case Pi > 2 (see Section 4.4). Our proof and that from [8] 
share much of the analytic part of the argument. The proof in [8] however is structured 
around an induction on k that is not available in the fractional rank case. We eliminate 
the induction from the argument, and treat all /c in a similar fashion. This new type of 
approach will involve a rather delicate combinatorics. 

Theorem 1.1 also has a kernel formulation: 

Theorem 1.3. Let K : R,'^('^-i)-'= 'R, he a C alder on- Zygmund kernel, and let k : 

generic linear forms. Let also pi be as in Theorem 1.1, and as- 
sume < k/d < n/2. For Schwartz functions Fi, . . . ,Fn : R'^ — > R define the multilinear 
operator 

T(Fi,... ,F,_i)(f) := / \\F,{x + k{t))K{t)dt, f G R'^. 

jRd(n-l)-fc 

Then T extends to a hounded operator 

T : X •••L^'"-i ^ LP'^. 

In the case A; > 0, the operators we investigate will typically have some modulation 
invariance. As a consequence, proving their boundedness will involve time-frequency 
analysis similar to the one in the proof of the Bilinear Hilbert Transform ([6], [7]). 

The assumption k/d < n/2 is crucial to our analysis. It can be shown in particular to 
guarantee that T has no symmetries of higher order (i.e quadratic symmetries). On the 
other hand, even in the one dimensional case, the quadratic symmetries may be^ present 



These symmetries are however not guaranteed to exist. In [3] there are examples in the case d = 1, 
k = n/2 which do not have any quadratic or higher order symmetries 
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when k > n/2. Perhaps the most famous example with k/d — n/2 is the Trihnear Hilbert 
Transform (n = 4, A; = 2, d = 1) 

T{FuF2,Fs){x)= / F,{x + t)F2{x-t)F,{x + 2t)-. (5) 

Another important example with k/d = n/2 is the degenerate two dimensional Bilinear 
Hilbert Transform {n — 3, k — 3, d — 2) 

T{FuF2){x,y)= / F^{x + t,y)F2{x,y + t)-, (6) 

where the quadratic symmetries are not singled out, but rather part of an infinite group 
of symmetries generated by degeneracies (Sec [1] for details). The current techniques do 
not seem enough to address the case k/d = n/2, where it is likely that some form of 
" Quadratic Fourier Analysis" will play a role. A quick single scale heuristics is provided 
in Section 4.1. However, Theorem 1.1 above shows that they can address the case of 
a rank k/d arbitrarily close to (and smaller than) n/2. This paper grew partly as an 
attempt to get more light on these issues. 

A second motivation for the considerations in this paper comes from connections with 
Additive Combinatorics, in particular with the issue of true complexity of a system of 
linear equations. Our analysis makes the point that nondegenerate systems characterized 
hy k/d < n/2 have true complexity 1, in the language from [3]. These things are described 
in Section 4.1. The single scale heuristics provided there sheds a lot of light on the 
difficulties we encounter in the multi-scale context, and we encourage the reader to go 
over that section first. 

To investigate the boundedness properties of the operator in Theorem 1.1, it will be 
convenient to work with the dualized form defined by 

A(Fi,... ,F„) := / T(Fi,... ,F„_i)(f)F„(f)df = 
1 5{e'^ + ... + ^''^)M{i)T,{^'^) . . . K{&^)dl 

We will show that 

n 

|A(Fi,...,F„)|<nil^ilU- 

1=1 

In the next section we will discretize the form and convert the problem to the bound- 
edness of a model sum operator. We will use wave packets and multi-dimensional boxes 
(called tiles) to serve as their Heisenberg boxes. The tiles are then organized into trees, 
and eventually into certain products of trees, called vector trees. Most of the argument is 
then devoted to estimating the counting function associated with these vector trees. This 
is the main new contribution of our paper, and makes the object of Section 4. 

We will assume m > 2. The case m — Q (i.e. A; = 0) is entirely classical and goes back 
to the work of Coifman and Meyer. No modulation symmetries are present in this case. 
The case m = 1 can be addressed by the argument in [8], by crudely majorizing the rank 
k/d by m. Indeed, since 1 — m < 3/2 < n/2, we could treat the operator as if it had 
rank m e Z. Alternatively, one can apply the argument from section 4.4 here. 
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2. Discretization 

Prom now on, the notation | • | will refer to the cardinality of a finite set, the length of 
an interval or the volume of a multi-dimensional interval, depending on the context. The 
side length of a cube R will be denoted with 1{R). The discretization procedure in this 
section is very similar to the one from [8]. We omit most of the details. 

We will work with the constants 

1 << Co << Ci << C2 << C3 << C4, 

whose values will not be specified explicitly, but will rather be clear from the context. 
The constant Cq will be chosen first. It will be large enough depending on V, n and d. 
Then Ci will be chosen large enough, depending on T' but also on the choice of Cq. Then 
C2, C3 and C4 are chosen in this order, sufficiently large compared to their predecessor in 
the sequence. C4 will be an integral power of 2. No upper bounds will be forced upon Q 
in terms of Ci_i, so when some Ci is selected, it can be chosen as large as desired. The 
fact that these constants will depend on T' will be reffected in the fact that the bounds 
in Theorem 2.1 below also depend on V. This dependence will be ignored. 

Let O be a finite collection of no?— dimensional cubes a; = aJi x . . . x uJn, where each 
(Ji is a d— dimensional cube. These cubes are a sparse enough subcoUection of a Whitney 
decomposing the frequency space R"*^ \ T'. They will serve to localize various pieces of 
the multiplier M. For each 1 < i < n define the projections 

Oi := {uji-.cue O}. 

These collections will satisfy the following properties: 

• (i) (separation in scale) For each cJi x . . . x cJ„ e O, there is a Z e Z such that 

IcJil = . . . = |aJ„| = (C4)' 

• (ii) (separation in distance) If cui ^ cu'^ & Oj and \cui\ — then dist(a;j,a;0 > 

• (iii) (Whitney property) For each G O we have 

10"^Codiam(cu) < dist(u;,r') < 10Codiam(a;) 

• (iv) (rank m) Any m of the n components of some a) e determine uniquely the 
remaining n — m components. 

Let V be the collection of all dyadic cubes in R'^. Let be a smooth function whose 
Fourier transform is adapted^ to the cube [—1/2,1/2]'^. For each R e T> with center 
(ci(i?), . . . ,Cd{R)) and each cJj e Oj such that = 1 we define the normalized 

wave-packet 

0B...Ax^,...,xa)- ^^^,,^<py ^^^^ ^^^^ je 



^That means supported in [—1/2, 1/2]'' and with the first few derivatives bounded by one 
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By using a standard discretization procedure like in [8], involving a decomposition for 
the multiplier M adapted to the collection O, and then a Gabor basis decomposition for 
each Fi, Theorem 1.1 will follow from the discretized version below: 

Theorem 2.1. Let Fi G L^^(R'^) with 2 < pi < oo as in (3). Let O be any finite collection 
satisfying (i)-(iv). Then for generic V 



Moreover, the implicit constant in the above inequality is independent of Fi and of the 
collection O. 

We will fix O for the remaining part of the paper. The following rank properties will 
be consequences of (i)-(iv) above (see Section 6 in [8] for details) and of the requirement 
that r' is the graph over any k of the canonical variables: 

• (v) (overlapping) If for some uj,u!' G O and for some A C {1,2,... . n} with 
l^l = m we have uji C Su/ for each i & A, then uJj C CioJ/ for each 1 < j < n 

• (vi) (two lacunary indices) If for some lD,lD' G O with the additional property 
that diam(a;) < diam(a;') and for some Ac. {1,2,... ,n} with 1^41 = m we have 
cui C Scui for each i & A, then there exist 1 < ii 7^ ^2 < such that cJj ^ SuJ/ for 
each j G {ii,i2}- 

We recall the following definition from [4] . 

Definition 2.2. A collection Q of intervals in R*^ is called a central grid if 

• (gi ) R,R' eg and RnR' implies R C R' or R' C R 

• (g2) R,R' eg and R C R' implies C2R C R' 

• (gS) R,R' eg and 2\R\ < \R'\ implies Caji?] < \R'\ 

• (gA) R,R' eg and \R\ < \R'\ < 2\R\ implies dist{R,R') > C2\R'\ 

It turns out that each sufficiently sparse collection of cubes can be turned into a central 
grid. See the considerations following Definition 1 in [4] for details. 

Lemma 2.3 (Centralization). Let ^0 be a collection of d— dimensional cubes satisfying 
the following properties 

• (1) R,R' e go and \R\ < \R'\ implies \R\ <_C3\R'\ 

• (2) R,R' e go and \R\ = \R'\ implies dist{R,R') > Cgji?]. 

Then for each R G go, there is a d— dimensional interval (not necessarily a cube) R such 
that R C R C 2R and such that the collection g := {R : R G ^0} is a central grid. 

Note that both the collection Oj and the collection 2CiOi := {2CicJj : cJj G Oj} satisfy 
the requirements in Lemma 2.3 (if for example C3 << C4 — 2Ci). By applying Lemma 
2.3 to each of these collections, we associate to each uji e Oj two intervals cUi and cUi such 
that CUi C. CUi C. ScJi, 2CicUi Q cUi C. ACicUi, and such that the collection 



n n 





Oj := {uui : CUi £ Oj} 



is a grid, while the collection 



Oj := {uJi : uUi G O^} 
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is a central grid. Note that we will need a stronger assumption on the second collection. 

For each u; = cJi x . . . x oJ^, we will use the notation a; = cji x . . . x c<j„ and cD = cJi x . . . x aJ„, 
and these two new intervals will form the collections O and O, respectively. 

From now on we will abandon the collection O and only refer to the collections O and 

6. 

Note that the sidelegths of each interval a; e O are within a factor of two from each 
other. Denote by TZ{uJi) the collection of all dyadic cubes R in R'^ such that = 1. 

Since cD is a cube, the collections TZ(ujj) arc all the same for 1 < i < n. For each 
1 < z < n, let Sj denote the collection of all Sj := Rs^ x aJ^., with cUj ranging through Oj 
and i?s. ranging through TZ{uJi). The collection S will consist of all s — Eg x Us, with 
cUg := a;^^ x . . .ujs„ ranging through O and Rg ranging through (any of the) 7l{uJsi)- 

An element s = Rg x cj^^ x . . . ujs„ G S will be referred to as a multi-tile, while its 
components Si := Rg x utgi G Sj will be referred to as tiles. The intervals Rs and ujs will 
be referred to as the spatial and frequency components of s. 

We introduce some relations of order, which are very similar to the ones in Definition 
6.1. in [8]. 

Definition 2.4. Let Sj, s- e Sj be two tiles. We write 

• Si < s' if Rsi C Rgi^ and uOs'^ C uJs^ 

• -Sj < s • if Rsi C i?^/ and ujg'^ C lj^. 

• Si <' s'i if Si < s'i but Si ^ s[ 

We note that as a consequence of the rank properties (iv)-(vi) and the grid structure 
of O and O we get the following 

• (rl) (rank m) Any m of the n frequency components of some s E S determine 
uniquely the remaining n — m components. 

• (r2) (overlapping) If for some s, s' G S and for some A C {1,2, ... ,n} with 
|y4| = m we have Si < s'^ for each i E A, then sj < s'j for each 1 < j < n 

• (r3) (two lacunary indices) If for some s,s' E S with \Is'\ < \Is\ and for some 
Ac {1,2,... , n} with 1^41 — m we have Sj < s- for each i e A, then there exist 
1 < ii 7^ ^2 < n such that Sj <' s'j for each j e {ii, ^2}. 

Properties (r2) and (r3) are all about the frequency components of multi-tiles, the 

spatial components do not play any role. 

We also record for future reference the grid properties satisfied by the multi-tiles: 

• (r4) The collection {cu^. : G Sj} is a grid^ 

• (r5) The collection {(Zj^. : Sj G Sj} is a central grid 

It is also clear (due to (iii)) that if Co is sufficiently large, then 

• (r6) For each s G S we have C^uj^ nV ^0 

For each tile Sj = x cusi, we will have a wave packet associated with it, namely 

where cDj. is the cube in Oj that generates uJs^ via the procedure described earlier. Note 
that the Fourier transform of (ps, is supported in a;^ . , while spatially, (psi is a bump function 
qTiasi-localized near the cube Rg^. 



'While we can arrange that this collection is a central grid, too, we will not need this strong assumption 
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With this notation, Theorem 2.1 can be rephrased as follows: 

Theorem 2.5. Let Fi e L^'(R'') with 2 < pi < oo as in (3). Let S be a collection of 
multi-tiles satisfying (rl)-(r6). Then for generic V 



Moreover, the implicit constant in the above inequality is independent of Fj and of the 
collection S. 

The genericity of V will imply an additional rank property (r7) , that we choose not to 
state explicitly at this point, but which will become rather clear later (for example, see 
the beginning of Section 4.2). 

3. TREES 

In order to prove Theorem 2.5 we will organize each collection Sj into smaller structures, 
called trees. 

Definition 3.1. Let Rt be a dyadic cube in H'^ and let G R'^- A collection of tiles 
T <Z Si is called an i—tree (also sometimes referred to as tree, when the index i is either 
not important or when it is clear from the context) with top {Rti^t), if Rsi ^ Rt and 
£ '^sj for each Sj G T. 

In case G for each Sj G T the tree T is called i— overlapping. If G \ 
for each Si G T, the tree will be called i—lacunary. 

We note that a tree consisting of a single tile is both lacunary and overlapping (these 
are actually the only examples of such trees). In general, a tree must not necessarily 
be overlapping or lacunary. However, each tree can be split as the disjoint union of an 
overlapping tree and a lacunary tree. 

Trees will be used to construct similar structures consisting of multi-tiles, called vector 
trees. 

Definition 3.2. Let Rr^ be a dyadic cube in R'^ and let = ('Cf i' • • • ' n) ^ R*'- 

A collection of multi-tiles T G S is called a vector tree with top {Rr^^^r^) if for each 
1 <i <n, the projection Tj := {sj : s G T} is an i—tree with top {Rrf,^,^-). 

The vector tree T is called i— overlapping, if its projection Tj is an i— overlapping tree. 
If this is the case, we call the index i overlapping. Similarly, the vector tree T is called 
i—lacunary, ifTi is an i—lacunary tree. If this is the case, we call the index i lacunary. 

Remark 3.3. Note that the rank property (r3) implies that each vector tree has at least 
two lacunary indices. 

Definition 3.4. Let P C Sj be a collection of tiles. Its i—size is defined as 



where the supremum is taken over all lacunary i— trees TCP with tops {Rt,Ct)- 



n n 



iWpi- 



seS i=l i=l 
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The definition of the size of a collection P depends on the choice function F,. We choose 
not to index the size by Fi, since will later be fixed. 

The next Lemma shows that the size is dominated by the Hardy-Littlewood maximal 
function (see again [8] for details) 

N 1/2 



M^Fix) - sup f ^ / \Fny)dy) 



R cube in 

Lemma 3.5. Let T be a lacunary i—tree with top {Rt,Ct)- Then 

1/2 



^5 a consequence, 

sizei(P) < sup inf M2Fi{x). 

We will use the following estimate for vector trees: 
Lemma 3.6. Let T be a vector tree with top {Rt,^t)- Then 

n n 
sGT i=l 1=1 

Proof Use Holder's inequality with an estimate for two lacunary indices and with an 
l°° estimate for the remaining n — 2 indices. ■ 

For a collection of trees or vector trees, we will use the notation 

N^{x) := 5^ In^ix). 

Also, we will denote with ||A^^||bmo the dyadic BMO norm of Nj^. 

We next show how to split a collection P of tiles into collections of trees with controlled 
size. First, we show how to cut the size in half. 

Lemma 3.7. Let P C Si be a collection of tiles. There is a collection T of disjoint (as 
collections of tiles) i— trees in P such that 

RtC{x: inf M2F,(f) > sizei(P)}, T e (7) 
seRT 

E I^tI < (size,(P))-lF,||2 (8) 

WN^Wbuo < (sizei(P))-2[sup inf M2Fi{x)]^ (9) 
< (size,(P))-2[sup inf M2F,(f)]2[size,(Pn|F,||^^] 1 < q < oo 



and 



sizei(P\ U T) < ^sizei(P). 
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Proof The proof is very standard, so we only say a few words about it (see for example 
Lemma 7.7. in [8] or Lemma 6.6. in [9] for details). The trees in T arc selected in 2*^ 
stages. Wc need first a definition. Let \<l <d. A lacunary i— tree with top [Rt^^t) is 
said to be a (+, /) tree if 



for each e T, where ci{lo) denotes the l^^ component of the center c{lS) of lo. Similarly, 
the tree is said to be a (— ,0 tree if the inequality is reversed in (11). It is easy to see 
that each lacunary tree T is the disjoint union of at most 2^^ lacunary subtrees having the 
same top as T, each of which is either a (+, /) tree or a (— , /) tree for some \ <l < d. 
By pigeonhohng, it follows that if we ehminate from P all such trees having size^ at least 
^^^l^sizej(P), the remaining collection of tiles will have the size < |sizei(P). 

In a typical stage of the selection^ one selects lacunary trees T with tops {Rti Ct) which 
are (say) (+, /) trees, for a fixed and which satisfy 



One always aims for the tree which has the minimal value for ((^t)z among all the trees 
that quahfy to be selected at that moment. After such a tree is selected, this tree is added 
to the collection ^i(+,/) and its tiles are eliminated from P. Note that the remaining 
tiles which satisfy ^ and C R-^ will form a tree T*"*, having the same top as 
T. Add T*"* to the collection JF2(+,/) and eliminate its tiles from P. Then the cycle 
repeats, that is one starts searching again for a (+, /) tree satisfying (12). When no such 
tree is available, the (+, V) stage of the selection process ends. One adds all the trees from 
JFi(+, /) and from ^2(+, collection T. At this point one goes to the next stage 

of the selection process. 

The crucial observation is that in each stage, the trees from the collection ^i(+,/) 
have the following property^: If T, T' e J^i (+,/), e T, s- e T' and ^JJs^ C a;^/, then 

R , n i?T = 0- 

i 

Let us briefiy see why this property holds. The fact that uJs^ C uj^i^ the separation in 
scales, (r4) and the definition of (+, /) lacunary trees implies that (Ct'); > (Ct);- This in 
turn implies that the tree T was selected earlier than T'. The separation in scales and 
the fact that C\ « C2 easily imply that £ <^s^ If Rs\ and intersected (and this 
can only mean that R^i C i^x), this together with G '^s' would imply that right after 
T was selected, s\ would have qualified to be part of the tree T'*"*, and thus it would have 
been eliminated from P before the selection of T' began. The contradiction is immediate, 
thus we conclude that Rgt, fl i?T = 0- 

What the property we just proved means, is that any two trees from ^i(+, V) interact 
very weakly, in that tiles from different trees either have disjoint frequency components 
or strongly separated spatial components (so that the associated wave packets have little 
interaction). Using standard arguments, one could argue that this together with (12) 



The order of the stages does not really matter 

'In the literature, this property is referred to as "strong disjointness" 



(Ct)/ > q(c^sJ 



(11) 




(12) 
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implies the following Bessel type inequality 

\Rt\ < (size,(P))-lF,| 

TGJPl(+,0 



We can clearly replace ^i(+, /) by J^2{+: I) in the above inequality. 

By combining the contribution from all 2^^ stages we get (8). Also, (9) will follow 
similarly, by a standard localization argument. (10) follows from (7) and (9), via an 
application of John-Nirenberg's inequality (since Pi > 2). Finally, (7) is a consequence of 
Lemma 3.5. ■ 

By iterating Lemma 3.7 we get 

Proposition 3.8. Let Pj C. Si be a collection of tiles. Then one has the following decom- 
position 

P. := U ^ 

2-'=<sizei(Pi) 

where 

sizei(Pf < 2-*^+^ 
and each Pf'^ is the (disjoint) union of a family Tf^ of trees such that 

ll^^wll. ^ "^^'i sup inf M2F,(f)]2[2'=^f'||F,||^^]V5, 1 < g < oo. 

This proposition gives us good control over the number of trees corresponding to each 
component i E {1, . . . , n}. In the next section we will combine trees from each component 
to create vector trees, and will relate their counting function to the ones of the individual 
trees. 



4. Counting trees 

Recall that m is the smallest integer greater than or equal to k/d, and that we have 
assumed that m > 2. 

Throughout this section we will assume that we have a collection of multi-tiles P C S. 
We also assume that each projection Pj consists of a family J^i of disjoint i— trees 

P. = U T- 

Our goal is to split the collection P into a family of vector trees with good point- 
wise control over the counting function Njr(x) of their tops in terms of the individual 
counting functions Njr^{x) (sec (25) and (34) below). To provide the reader with a better 
understanding of what we have to prove, we first give a single scale heuristics. 

4.1. Single scale heuristics. Assume we are in the particular case where each T in 
each family jFj consists of just one tile, of the form [0, l]'^ x uj. Thus, the collection J-'i 
will consist of a family of pairwise disjoint tiles of scale 1. Assume that for each i we 
know the cardinality The question in this case is, subject to axioms (rl)-(r6) and 
the genericity of F', to estimate the cardinahty \J^\ where consists of all the multi-tiles 
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s having the property that Sj e J^i for each i. Note first that (rl) immediately imphes 
that 

m 

with the same being true for each choice of m indices. This leads to 

n 

1^1 < ni-^^r^"- 

i=l 

This estimate is only satisfactory when m < n/2, for reasons that will become clear in 
the proof of Proposition 4.1 below, however, it will be of no use when m > n/2. The 
good news is that (13) is only sharp when k/d = m, in which case we also know that 
m < n/2. li k/d < m, the inequality can be improved, and one has to use the additional 
rank property (r7) guaranteed by the genericity of F'. 

For simplicity, let us see this in the particular case n — A, d — 2, k — 3. The addi- 
tional rank property (r7) in this case will state that for each pairwise distinct ^1,^21 "^3 ^ 
{1,2,3,4}, and for every two multi-tiles s,s' with Si = s'l, knowledge of both S2 and S3 
will uniquely determine both s and s'. See the beginning of Section 4.2 for details. Using 
this and applying Lemma 4.2 as in the next section, we get the improved inequality 

1=1 

and thus, after permuting indices, we get 

4 



\^\ < n 



3/8 



In general, one gets 



nk/d 



which is the optimal inequality. The important fact is that all exponents on the right 

hand side are < 1/2. 

As a consequence of (14) we observe 

Proposition 4.1. Assume as before that k/d < n/2. Then, for generic linear forms 
k : R<^('^-i)-fc -^Yi'^,l<i<n-lwe have 

/ T\Fi{x®k{r))F^{x)dtdx\<T\\\Fi\\oo " , 

sg[o,i]'* Jt&[o,i]din-i)-k 

whenever ||-Fi||oo — 0{1). 

Here © denotes addition modulo 1. This shows that the single scale operator relevant 
to our problem is controlled by the Fourier transform, and thus, it provides a heuristics 
for the fact that Fourier analysis will be able to address the multi-scale version of the 
problem (i.e. Theorem 2.5). In contrast to this, we mention that neither the operator in 
(5) nor that in (6) satisfy any similar bounds. 
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In the language of Gowers-Wolf from [3], Proposition 4.1 is saying that the system of 
equations associated with the hnear forms Zj has true complexity 1. 

It will become clear from the argument presented in the following sections that the 
precise conditions on T' needed in Proposition 4.1 (that is, needed to guarantee (14)) 
amount to the following two requirements. To formulate them, we use the same notation 
that we have used so far. More exactly, P' is the hnear subspace of (R'^)" defined by 

n— 1 

{{ff^\... ,ff''^) -.^2^'^ ' + h{i)] + V^""^ ■ X ^ e R[x,i\}. 

i=l 

We ask P' to be k dimensional and parametrizablc over every k canonical coordinates. 
We also ask that the following system of k R*^ valued equations in d unknowns ^^^^ e R'^ 
is compatible for each distinct ii, . . . , e {1, . . . , n} 

'GiA$^^) - G,,($^+'^) ^vj, l<j<d-l 
Giiii^^) = l<j<d,2<l<m-l 
Giji^^^) = Vj,m, l<j <k + l-dim-l). 

Proof (of Proposition 4.1) By discretizing as in the previous sections (and keeping the 
notation from there), we are reduced to proving that 

seS i=l i=l 

where S consists of multi-tiles of scale 1. Note that is roughly a Fourier coeffi- 

cient of Fj. For each 2~"'' < ||-Fj||oo let ^j_„. be the set of all tiles Sj with 

Note that since ||-Fi||2 = G{1), we have that \ J^i,ni\ ^ 2^"* (Bessel's inequality). If we use 
(14), it follows that the cardinality of the set J^{n) consisting of all multi-tiles s e S such 

that Si e J-i,ni for each i, will be 0(||j^^ 2 n ). 
The sum above is then bounded by 



2knA I d 

i=\ i=l 



We make two remarks about Proposition 4.1, and in general about the applicabihty of 
our Fourier analytic techniques. The first remark shows that some non-degeneracy is in 
general needed. Take for example 

/ / Fi{x + t,y)F2{x,y + s)Fs{x,y)dtdsdxdy 



J^,(F,)(0,y)J^y(F2)(x, 0)Fs(x,y)dxdy. 

'(x,j/)GT2 

One can check that here k = d = 2, and thus k/d < n/2. However, it can be easily 
seen that this form can not be bounded by a power of (and in general, by no reasonable 
function of) ||-Fi||oo- The explanation is that P' is degenerate. 
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The second remark will show that with the current techniques, the requirement k/d < 
n/2 can not be relaxed. We illustrate this in the case d — 1 with the single scale version 
of (5) 



Fi{x e t)F2{x e t)F3{x e 2t)F^{x)dxdt. 

'xe[o,i] Jte[o,i] 

Application of the Fourier inversion formula for each function shows that the form 
above is essentially (up to some constants) 



(nj,n2,n3,n4) j=l 
^1+^2+^3+^4 — 



Our approach relies on bounding the sum above by using the triangle inequality, by 



4 



(n^, 712,^3, 714) i=l 
n-]^ —712 + 2722=0 

Choose Fi(x) = F2(x) = F^ix) = e^^'^""^ and Fi{x) = e^^^^^^', for large N. One 
can check that when i e {1,2,3}, ~ N''^/'^ for A^/lOO < n < A^, and that 

\F\{n)\ ~ Ar-V2 < ^ < -AT/IOO. It is easy to see that the term in (15) is > 1. 

Since one can check that also ||-Fi||oo ^ N'^^"^ for each i, the expression in (15) is not 
O(ll-filloo). Let J-i refer to the collection of n with |i^i(n)| ~ N~^^'^, and let J- be the 
collection of quadruples (ni, n2, ns, 714) as in (15), such that rij e for each i. We get 
that 



1/2 



and this sharp inequality becomes inefficient for any application, due to the exponents 
being 1/2. 

A moment's reflection shows that we can get the same outcome whenever k = n/2. Of 
course, our approach fails to address the case k/d = n/2, because of the use of triangle's 
inequality in (15). It is likely that the correct approach in this case (and in general, 
whenever k/d > n/2) is hy analyzing appropriate quadratic Fourier coefficients. 

To address the multi-scale case, we will have to count vector trees, rather than just 
multi-tiles. To make the exposition more transparent, we will exemplify our approach in 
the case d — 2. The general case d > 2 will follow via a similar argument, and is briefly 
sketched afterwards. 

4.2. The case d = 2. We start this section by proving that, under the particular as- 
sumptions on F' from Theorem 1.2, the multi-tiles in S satisfy the following additional 
rank property: 

• (r7) For every distinct indices ii, . . . ,ik C {1,2,... ,n} and for every (not neces- 
sarily pairwise distinct) multi-tiles s,s',p,p' e S we have 

dist {ujs ,ujp), dist (a;^/ , a;^/ ) < Cq D^ax , 
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where 

Dmax := max{dist(a;s.^ , Us'.^),dlsi{up^^ , ujp'.^)}+ 
max{dist(ci;5. , cUp. ),2 < I < m} + max{dist(ci;/ , Up{ ) , m + 1 < I < k}+ 

+ max{diam(a;s), diam(a;s/), diam(a;p), diam(a;p/)}. (16) 

It will be important that we can control both dist(a;s, a;^) and dist(a;s', a;p/) by a value 
at most Cg times larger than Dmax- (^7) will be used twice in the future, in conjunction 
with a choice of constants such that Cq << Ci « €2- 

To see why (r7) holds, let 7^ G Clus H T', 7^/ G Clujg' H T', 7^ G Clujp fl V and 
7j,' e Clujp> n r' (by (r6)). To simplify notation, assume ii — l. Note that 

dist((7ji, (7y)i),dist((7p)i, (7p/)i), dist((7s)i, (7^)^), dist((75/),, (7p/),) < C^D^ax 

(17) 

We will use the notation ^'^^^ := {^[^\ . . . ^ TC'. Recall the functions Gi from the 

introduction. Consider now the following system of {k vector, or equivalently 2k scalar) 

hnear equations in 2k variables ^[^\ . . . , e R, and coefficients Vj e R^. 

'Gi(e«)-Gi(e(2))=^;i 

< Grnii^'^) = Vm (18) 

G^+i(e(2)) = W 
G,{i^^))=vk 

By hypothesis, the system has a unique solution for each Vi. Let now E 
consist of the first k entries of 7^ — jp and 75/ — 7^/, respectively, and let vj be the values 
corresponding to this choice in the system above. Note that for each j E {1, . . . ,k} 

\Vj\<2C^Dmax, (19) 

by (17). It will follow that the solution {^^'^\^^'^^) will be bounded in norm by the norm 
of (fl, . . . ,Vk) times a constant that only depends on the coefficients of the linear forms 
Gi, . . . ,Gk- Thus, if Co is chosen large enough compared to these coefficients, we get that 
dist(75, 7p), dist(7s/, 7p/) < C^D^nax- Now (r7) is immediate. 

There will be two distinct stages, each of which will generate some vector trees. In 
each stage, before we construct the vector trees we will have to carefully reshuffle the 
collections Pj. 

Let us describe the first stage of the construction. We will first aim at separating the 
trees in each family J^j, and to achieve this we will employ a trick first used by Fefferman 
in [2]. Fix 1 < i < n. For each I > 0, let 

P,(0 := {si e Pi : 2' < |{T G J^, : e c^s,: Rs, Q Rt}\ < 2'+'}- 
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Note that (Pi(/))/>o forms a partition of P,. Next, we organize each into i— trees 
with top tiles. More precisely, consider the collection of all tiles P*(^) Q Pj(0 which are 
maximal with respect to the order relation <. 

We observe a few things. First, the tiles in P*(/) are pairwise not comparable under 
<. Second, for each tile Si e Pi(/) there is a unique tile s* e P*(/) such that Si < s*. 
To sec the uniqueness part, assume by contradiction that Si < s* and Si < s** for some 
s* ^ s** e P*(/). Then R^* H Rs** 7^ 0, which forces a;^* n a;^- = 0. This together with 
the fact that s*, s** e Pi{l) will imply that 

|{T e : e Usi, Rs, C Rt}\ > 2 X 2', 

which forces the contradiction Sj G IJ/'>«+i 

Now, for each s* G P*(/) we form the i— tree with top {Rs*, c{^s*)) consisting of all tiles 
in Pi{l) which are < s*. We have just seen that these trees partition Pi{l) and that tiles 
in distinct trees are not comparable under <. Call the collection of these trees J^i{l). It 
is easy to see that 

N:r^^i){x)<N^X^), xeR'. (20) 

We now use these trees to build our first generation of vector trees. For a moment we 
will abuse notation and for two s, s' G P we will write -s < s' if Sj < s[ for each 1 < i < n. 
Note that the extension of < from tiles to the multi-tiles in P remains a relation of order. 

— * 

For each I :— (li, ... , Z„) with h, . . . , ^ denote 

:= {s G P : Si G for each l<i<n}. 

— * — * 

The selection process goes as follows. Fix I. Find a maximal (with respect to <) s G P{1), 
then construct the vector tree T(s) with top {Rs,c{us)) consisting of all s' G P{1) such 
that s' < s. Then eliminate all multi-tiles in T(s) from P(/), and restart the selection 
(with this new value for P(/)). When no such vector tree remains to be selected, the value 
of P(/) will be 0. Denote by J-'{1) the family of the vector trees selected at this stage, 
and by C those vector trees which consist of at least two multi-tiles (i.e., in 

addition to their top, they also contain a multi-tile with a scale distinct from the scale of 
the top). The vector trees in \ will be reshuffled later, so we will ignore them 

for the moment. 

We will first show how to control the counting function A^jr..(f) in terms of each Njr^(^i.y 
For each T G J-'{1) we denote with Tj the projection of T onto Sj (and this is an i— tree). 
For each x G R*^, denote by T**{l,x) the collection of all vector trees T G J-**{1) such 
that X G R^. A similar definition holds for J^i{li,x). 

Let us first make two easy observations. On the one hand, note that for each T G , 
all the tiles of Tj arc contained in a unique tree from the family J-i{li) (this follows from an 
earlier observation, and from the fact that the tiles in Tj are pairwise comparable under 
<). We will refer to this tree as the i*^ standard projection of T. On the other hand, for a 
fixed X and some T G J-'** {I, x), if for each 1 < i < n we know the i^^ standard projection 

— » — * 

of T, we will also automatically know T (this follows from the maximality involved in the 
selection of T). We will see in Proposition 4.4 that more is true, namely that knowledge 

— * 

of just m of the standard projections suffices to determine T. 
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At this point we recall the following lemma from [5]. 
Lemma 4.2. Let X and A be finite sets and let g : X ^ A be a function. Then 



|2 

\{ixi,X2) e X^ : g{xi) = g{x2)}\ > 



\X\ 



\A\ 

We will apply this lemma with X = J^**{l,x), A = J^i(/i,f) while g{f) := /i(f) is the 
first standard projection of T (for later use, we extend this definition to all /j, 1 < i < n). 
We get that 

\m,T2) e X' : A(f = A(f,)}| > 'j^"^'^):, ■ (21) 

Next, we will estimate from above the size of the set 

n := {(Ti,T2) e X^ : /i(f i) = ^(fs)}. 
In particular, we will show that the function 

k 

H-.n^llj'.ii.^x) 

defined by 

//(fi, T2) = (/2(Ti), . . . , /^(fi), /„.+i(f 2), . . . , /fc(f 2)) (22) 

is injective (recall that k/d < n/2, thus k < n, and the functions /j will make sense for 
each 1 < i < k). This fact combined with (21) will lead to the desired estimate 

N^*.^r)i^)<[JlN:f,ii,)ix)j . (23) 

Note that the sum of the exponents on the right hand side equals the rank k/2 (and this 
is the best one can do). It will be crucial that this number is < n/2. Similarly, in the case 
of general d, one can arrange things such that the sum of the exponents will be^ < n/2. 
This is explained in the next section. 

By using this, (20) and the fact that for each x e R^, 

7V^,(y(f) = Oif2'^>7V^,(f), (24) 

we find that^ for each e > 

^ 1/2+e 

\j=l 

Here and in the following 

JF** := [Jj^**{r). 
T 



^It seems likely that one can achieve an inequality where the sum of the exponents is k/d. However, 
we will content ourselves with a sum barely smaller than n/2. 
^The extra e exponent hides a logarithmic gain 
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Also, since there was nothing special about working with indices 1, . . . ,k,we can permute 
them and get similar inequalities. Combining this with the fact that k < n, we get the 
final inequality 

^^-(^) < (|n^^.(^)j > (25) 

for some S < 1/2, depending only on k and n. The precise value of 5 will not be important, 
what matters for later apphcations is the fact that it is strictly smaller than 1/2. 
Let us now prove the following: 

Proposition 4.3. The function H is injective. 

Proof Assume that 

(fi,f2),(Ui,U2) eTi 
have the same value under H. We will prove that (Ti,T2) = (Ui,U2). Let s,s',p.p' 
the top multi-tiles of the vector trees Ti,T2,Ui,U2 and let D^^ax be as in (16). Our 
hypothesis will easily imply that 

Dmax < max{diam(a;s), diam(a;5/), diam(a;p), diam(a;p/)}, (26) 
and by (r7) we get 

dist(a;s, a;^), dist(a;5/, a;^/) < Cq max{diam(a;5), diam(a;s/), diam(a;p), diam(a;p/)}. 

(27) 

We choose one of the four vector trees for which its top multi-tile has the largest scale of 
the frequency component. To fix notation, we can assume without any loss of generality 
that this vector tree is Ti. Since Ti has at least two multi-tiles with distinct scales, we 
can find t G Ti such that diam(a;t) > diam(a;s). This observation combined with (27), 
(r5) and the fact that Co,Ci << C2 implies that ujp C a)*. Since Rt C Rg and since 
Rg C Rp (this being a consequence of the fact that x ^ RgCiRp and the fact that the scale 
of Rp is larger than the scale of Rs), it follows that t < p. But we also know that t ^ s, 
and thus Ti and Ui will share all standard projections. This forces Ti = Ui. This will in 
turn imply that T2 and U2 share at least m standard projections (corresponding to the 

— * — * 

indices j G {1, m + 1, m + 2, . . . ,k}; recall that 2m —l<k). The fact that T2 = U2 will 
follow from the following proposition, which is somewhat reminiscent of the rank property 
(iv). . 

Proposition 4.4. Let T G J-'**{l,x). If we know the value of fi{T) form of the n values 

— * 

of i, then we know T. 

Proof Let f , U G T**{t,x) such that fi{f) = /^(U) for each i G {1, . . . ,m}. Follow 
exactly the same approach as in the proof of Proposition 4.3, this time applied to the 
pairs (f,f) and (f,U). ■ 

At this point the proof of (23) is complete, and the first stage of our construction ends. 

— * 

In the second stage, we will have to deal with the collections of vector trees J^*{1) :— 

— * — * — — * 

T{1) \ T**{1) each of which consists of just one multi-tile. We will denote by P(Z) the 
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collection of all these multi-tiles. The additional key property that these multi-tiles will 
obey is 

s s' e P{f) => s, s' are not comparable under < . (28) 

This is a consequence of the maximality involved in the selection of the vector trees from 
the previous stage. (28) will turn out to be crucial in proving the analog of Proposition 
4.4, see Proposition 4.5 below. 

The plan is to reshuffle P(/) into convenient vector trees. To achieve this, we will first 
split each Pj(/) into overlapping trees (interestingly, from now on, the lacunary trees will 
not play any role). For each r G Z with 2'^ < sizej(Pj(/)), let 

Pf^f) e P.iV) : 2-^ < |i?,J-V2|(F„0,J| < 2"^+^}. (29) 

Note that 

- U 

2-'-<sizei(Pi(r)) 

We next use a greedy selection algorithm as before to split each P\^\l) into a collection 
jFj(r, /) of disjoint i— overlapping trees with top tiles pairwise not comparable with respect 
to <. This implies via a classical TT* argument (see for example Corollary 7.6. in [8]) 
that 

||iV_^.(,r)(a;)||BMO<2'^[ sup inf M2F,(f)]2. (30) 

Also, (29) will imply that Nj^,^^f^ is supported in the set {x : M2Fi{x) > 2"''}. Combining 
this with (30) and John-Nirenberg's inequality we get 

II^^.Wl(^)ll*^22^[ sup mf M2F,(f)p[2^^ni^.ll^:]'^ l<^<oo. (31) 
We then apply Fefferman's trick again, as in the first stage of our construction, this 

— * 

time however with respect to J^i{r, I) and < (rather than <). 
For each ^ > 0, let 

Pt\r,q) := {s, e Pf)(Z) : 2« < |{T G J^,{r,l) : e a;,,, R,, C R^}\ < 2^+^}. 

Note that {P\'\l, q))q>o forms a partition of P^'^^(/). Next, we organize each Pf ■*(/, q) into 
i— overlapping trees with top tiles. More precisely, consider the collection Pf''*\l,q) C 
P-^''(/, q) of all tiles which are maximal with respect to the order relation <. 

It follows as before that the tiles in Pf''*\l, q) are pairwise not comparable under < and 
that for each tile Sj G Pf'\l, q) there is a unique tile s* G P^['*\l^ q) such that Si < s*. 

Now, for each s* G P'f'*\l,q) we form the i— overlapping tree with top {Rs*.,c{uJs*)) 

consisting of all tiles in P^[\l, q) which are < s*. As before, these trees partition Pf\l^ q) 
and tiles in distinct trees are not comparable under <. Call the collection of these trees 
^j(r, / , g), and note that 
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For each vectors q and r let JF(r, /, q) consist of all the multi-tiles s with Sj e 'P^['\l, q^) 
for each i. This consideration is motivated by (28), which imphes that each vector tree 
mP{f) can have only one multi-tile. In the following, we will prove that 

^HrX^li^) ^ [U ^^.ira,,.)(^)j ' ^ e (33) 

Using this, (32) and the fact that for each x e R^, Nj.,^^, r? )^^) = if 2* > N^.,^^, r)(^)) 
we find that for each e > 



l/2+e 



2 



Here and in the following, 

^(f,r):=y^(f,u-). 

q 

Again, by permuting indices and since k < n, we get 

^Hr^r)i^^^(^^^.ir.,l)i^^^ ' ^ei^'' (34) 

for some S < 1/2. Wc mention that (34) will later be used in conjunction with (31) and 
with the upper bound (24) on the Zj. 

We next prove (33). The argument follows exactly the same scheme as in the previous 

— * 

stage of the decomposition, and we sketch it briefly. We denote by X := J^{f, I, q, x) the 

— * — # — * 

collection of all vector trees T e T{f, I, q) with x e R^. For a vector tree T we will now 

— * — * 

denote by fi{T) the tree from the collection ^(n, I, qi, x) which contains the tiles Si, for 

— * 

s e T. It will follow as in the previous stage of the construction that these projections 
are well defined. Define 7i and H as before, more precisely 

k 

H -.n^ JJjF,(rj,/,gj,f). 
Proposition 4.5. The function H is injective. 

Proof The proof of the injcctivity of H follows the same lines as the one of Proposition 
4.3, with only one key difference. Assume that 

(f 1 - {s}, f 2 {«'}), (Ui - M, U2 {p'}) e n 

have the same value under H. We will prove that (s, s') — ip,p')- Let D^ax be as in (16). 
Our hypothesis will easily imply that 

Dmax ^ Co max{diam(a;5), diam(a;5/), diam(a;p), diam(a;p')}. 

This estimate is stronger than the one in (26), in that on the right hand side here we have 
the diameters of the cubes a;, rather than those of the cubes Co. This is due to the fact 
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that the standard projections now reflect positioning with respect to <, rather than <. 
By (r7) we get 

dist(a;s, u!p),dist{u!s',ujp') < Cq max{diam(a;s), diam(a;s/), diam(a;p), diam(a;p/)}. 

(35) 

We choose one of the four vector trees whose frequency component has the largest scale. 
To fix notation, we can assume without any loss of generality that this vector tree is s. 
Observe that (35) shows that a)^ and ujp must intersect, if Cq << Ci. But then (r5) 
implies that oup Q uJs- This combined with the fact that x & Rg H Rp ^ implies that 
s < p. From (28) wc immediately get that s — p. This in turn implies that s' and p' 
share at least m standard projections. This is equivalent with saying that s[ < p\ (or vice 
versa), for at least m values of i. From (r2) we get that s' < pi (or vice versa). A final 
invocation of (28) concludes that s' = p' . ■ 

4.3. The case d > 2. The case of arbitrary d follows by considerations very similar to 
the ones involved in the case d = 2. The rank properties (very much in spirit like (r7)), 
that will be needed throughout the proof will not be stated explicitly this time, but will 
rather become clear from the non- degeneracy assumptions on F' that will be stated in 
each case. We briefly sketch the details. 

Recall we are under the assumption m > 2. We can in addition assume that m > n/2 
(and thus k/d < m). This is because if m < n/2, then, as explained earlier, one can 
crudely treat the operator as having integral rank m, and apply the methods from [8] (or 
alternatively, the approach described in Section 4.4). 

We will need the general case of the combinatorial lemma from [5] . 

Lemma 4.6. Let X and Ai, . . . , Aj^-i be finite sets and for each 1 < i < d — 1 let 
Qi : X ^ Ai he a function. Then 

\X\ 

\{{xi, . . . ,Xd) e X'^ : gi{xi) = g'j(xj+i) for all I < i < d - l}\ > _^ 

[[i=l 

The two stages of the construction arc the same as in the case d = 2, but the choice of 
the function H involves some modifications. As in the case d = 2, H will have the same 
formula in both stages of the reshuffling process, and proving its injectivity will involve 
very similar ideas. Thus, to fix notation, we only sketch the argument corresponding to 
the first stage. Take X = JF**(/, x). Note that m < n + 1 (since m < and n > 3), so 
the sets x) are defined for each 1 < j < m + 1. We distinguish three separate cases, 

and will address each of them below. 

Case 1: n is even. The first case we describe is when n is even. It follows that m < |. 
Without loss of generality we can assume that 

max{| J^,•(Zj, x)\ : 1 < j < m} — \ J-'i{li, x)\ < mm{\J^j{lj, x)\ : m + 1 < j < n}. 

(36) 

Apply Lemma 4.6 with Aj — J^i{li, x) , gj — fi. Also, define 

n := {(fi, . . . , f,) e X'^ : gj{Tj) = g^{Tj+i), l<j<d-l}, 

j 
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where each Bj equals one of the sets Ti{li,x), and 

H{f,,...,f,) := 

(/2(f i), /3(f i), . . . , /^(f i), /2(f 2), /3(f 2), . . . , /^(f 2), . . . , /2(f /3(f d), . . . , Ufa)) 

We briefly comment on this construction. It is one of many one can do, and while the 
non-degeneracy requirements to make a particular choice of H injective will depend on 

H, they are achieved for generic F'. For example, we chose to assign entries of the form 
/21 • • • I /m to each of the trees T^, but we could also have chosen instead to alternate 
between these entries and the entries fm+i, ■ ■ , f2m-i, as we did in the case d = 2. The 
only restrictions are that the entries corresponding to each tree are pairwise distinct 

— * — » 

(otherwise redundancy occurs), and that the entries for are also distinct from fi{Tj). 

Let us assume for the moment that H is injective. Combining this with Lemma 4.6 we 
get that 

m 

\j^**{i,x)\ < {\j^,{h,x)r'Y[\j^,{i,,x)\)'/'. 

Due to our assumption (36), one can easily check that this implies 

n 

l^**(^>)l<(ni-^:^(^^'^)l)'' (37) 
for some 6 < 1/2, which as explained earlier, is the desired inequality. 

— * — » — # — * 

Let us now see why H is injective. Assume that (Ti, . . . , T^), (Ui, . . . , \Jd) E Ti. have 
the same value under H. The non-degeneracy condition that we need is that the following 
system 

Gi(e('))-Gi(e(^))=^;2 



< G^ii^'^-^y) - G^{i<^^y) = v,_, (38) 

G2{^^'^)=V^,2 



G^{i^'^)^va,m 

has at most one solution, for each choice of targets vi, . . . ,Vd-i and I'^j, 1 < i < rf, 
2 < j < m. Note that there are dm — 1 R'^ valued equations in d unknowns ^^^^ e R*^, 
and that we have {dm — l)d > dk, since m > |. Thus the system above is always 
overdetermined, and we require that the matrix associated with it has maximum rank 
dk. It is not too hard to check that this is achieved for a generic choice of F'. Actually, 
our choice of H is in such a way, that the system consisting of the first k vector valued 
equations above will generically give rise to a compatible system. 

The injcctivity of H now follows as in the case d = 2. First, there must be some tQ such 
that cither Tj„ or Uj„ contains a multi-tile with the frequency scale larger than or equal 
to the scales of all the multi-tiles from the trees Tj, Uj. As before, we get that Tj^ = Ui^. 

—* — * 

The equality Tj = Uj for the remaining indices i will follow from a " domino effect" . It 
first follows for the neighboring indices i — Iq — 1 and/or i — iq + 1, using the fact that 
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/i(f,) = /i(f,J = /i(U,J = /i(U,), /,(f ,) = /,(U,) (for J e {2,3, . . . ,m}), and using 
Proposition 4.4. The domino effect continues until all indices are covered. 

Case 2: n is odd. Since we have assumed that m > n/2, it follows that m = 
In particular, we observe that whenever ii ^ {^2, . . . ,i2m-i} with {^2, . . . ,im} pairwise 
distinct and {im+i, ■ ■ ■ ,^2m-i} pairwise distinct, the following system with n — 2m — 1 
R*^ valued equations in 2 unknowns ^■'^^ , l**^^^ e R*^ 



< G,je^^)^v^,m (39) 



will have at most one solution for a generic F'. This is one of the two non-degeneracy 
conditions that will be needed in both of the following two subcases. We will refer to the 
above system as a two-scheme. 

Subcase 2a: n is odd and d is odd. 

Without loss of generality we can assume that 

max{\J^j{lj, x)\ '■ 1 < j < m + 1} — 1^2(^2, ^)| < x)\ : m + 2 < j < n}. 

(40) 

Apply Lemma 4.6 with Aj — x) and gj — fi if j is odd, Aj — ^2(^2, x) and gj — /2 

if j is even. Also, define 

n := {(fi, ... ,f,)eX'': gj{fj) = gj{Tj+i), l<j<d-l}, 

j 

where each Bj equals one of the sets Ti{li,x), and 

H{T,,... ,Ta) := 

(/3(fi),/4(fi),... ,/^(fi),/3(T2),/4(f2),... ,/^(f2),... , /aCf,) , ,),..., /^(f,) , 

/m+l(Ti), /^+i(T3), . . . , /^+i(Trf)). 

Note that H has (m — 2)d + ^ entries. If m = 2, then the first (m — 2)d entries are 
not present. Note also that the last line above contains entries of the form fm+iCii), 
for all possible odd indices i. Here is why we can not use more than such entries. 
Why more entries would certainly reinforce the injectivity of H, the application of the 
injectivity (combined with Lemma 4.6) would be inefficient, in that it would not lead to 
(37). On the other hand, our choice for H combined with the assumption (40) is easily 
seen to guarantee (37). 

The reason we chose to assign entries /m,+i(Tj) to the odd indices i (as opposed to -say- 
the first indices) is to allow for the domino effect, as explained below. 

Note also that in the definition of gj we chose to alternate between /i and /2, to prevent 
certain redundancies from occurring. This will become clear in a moment. 
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In addition to the requirement that two-scheme (39) has at most one solution, we will 
also require that the system 

rGi(f«)-Gi(?^)) = ^i 



Gi{^^'^) = Vj,i, S<l<m,l<j<d 

< Grr^+l{i^'^)^V^,m+l (41) 



has at most one solution. Note that there are d{m — — \ R-^ valued equations and d 
unknowns in R*^. Our assumption that k/d < m will imply that the system is overdeter- 
mined, so our requirement is equivalent with saying that its matrix has maximum rank 
dk. As in the previous case, a generic choice of F' will guarantee that the first k equations 
above will give rise to a compatible system. 

Let us now see why the function H is injective. Assume that (Ti, . . . , T^^), (Ui, . . . , U^) G 
7i have the same value under H. Again, by using the system (41), we first obtain that 

— * 

Tjp = for some io- There are two type of scenarios that will sustain the domino effect. 

If io happens to be even, its neighbor(s) i will be odd, and thus we are guaranteed 
that fj{Ti) — /j(Ui) for each j e {3, . . . ,m + 1}. However, since i is a neighbor of io, 

—t — * — * — * 

it will also follow that /jo(Ti) = /jo(Tio) = /jo(Uio) = fjo(^i), where jo is either 1 or 
2, depending on whether i = io — 1 or i = io + 1. In any case, Tj and Uj will share m 
standard projections, and thus will have to coincide, by Proposition 4.4. 

The second scenario is when io is odd. In this case we can not prove by following the 

— * — * 

same procedure that = Uj, for a neighboring i. What we do instead is consider the 
two-scheme (s), one associated with indices io -|- l,io -|- 2, the other one associated with 
indices io — 1, io — 2. Each of these two-schemes is of the form (39). Indeed, 2m — 2 of the 
equations are going to come from (41), while the additional equation will be of the form 

for the first two-scheme and 

for the second two-scheme. In other words, in the case of the first scheme (with a similar 
situation for the second scheme) we know that Tj(,+i and Uj^+i share m — 1 standard 
projections, and share m — 1 standard projections, and in addition, we recall 

that /2(Tio+i) = /2(TiQ+2) and /2(Ujo+i) = /2(Uig+2)- The analysis of this two-scheme 

— * — * 

(essentially, a repeat of the argument from the case d — 2) will imply that Tj = Uj, for 
i e {io + l,io + 2}. 
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If we allow combinations of these scenarios, it is easy to see that the domino effect will 
eventually prove that Tj = Uj, for all i. 

Subcase 2a: n is odd and d is even. This is the most delicate case. We will use the 
same construction as in the previous subcase, but with d + 1 vector trees, rather than d. 
More precisely, without loss of generality we can assume that 

ma.x{\J^j{lj, : 1 < j < m + 1} = \J-'2{l2, x)\ < mm{\J^j{lj, x)\ : m + 2 < j < n}. 

Apply Lemma 4.6 (this time for d + 1 sets) with Aj = x) and gj = fi if j is odd, 

Aj = ^2(^2, x) and gj = /2 if j is even. Here j G {1, . . . , d + 1}. Also, define 

n := {(Ti, . . . , T;+i) e X'+' : ^.(f,) = ^,(f ,+0, l<j< d}, 

j 

where each Bj equals one of the sets J-'i{li, x), and 

H{Ti, ... , Td+i) := 

(/3(f 1), /4(f 1), . . . , fm{Tl), /3(T2), /4(f 2), • • • , /„.(f 2), . . . , Uf,+,), ^(f ,+1), • • • , /m(f rf+i), 

/m+l(Ti), fm+liTs), . . . , fm+l{Td+l)). 

In addition to requiring that the two-schemes (39) have at most one solution, we will also 
need that the following system (a copy of (41) with d ^ d + 1) 

G2(e^'))-G2(e>))=^2 



Gii^^''^) = 1^1,3, 3 < / < m, 1 < J < d + 1 

(42) 



— Vd+l,m+l 

has at most one solution. There arc (rf + l)(m — |) — | R'^ valued equations in d + 1 
variables from R'^. The fact that the system is overdetermined 

[{d+l)im-^)-^]d>id+l)k 

is a consequence of the fact that ^ < ^ ~ |- 

The argument will then run as in the previous subcase. We leave details to the interested 
reader. 



26 



CIPRIAN DEMETER, MALABIKA PRAMANIK, AND CHRISTOPH THIELE 



4.4. The case d — 1. As advertised earlier, a simplified version of the combinatorial 
argument in Section 4.2 can also handle the case d = 1, and more generally, the case of 
arbitrary d and m < n/2. We present the argument in this generality, and thus assume 
m < n/2, rather than d = 2. We note again that the argument here reproves the main 
Theorem in [8], in the locally case, without any appeal to induction. 

We perform the same two stage decomposition, exactly as in Section 4.2. This time 
however the function H will have a simpler form. More precisely, in the first stage one 
considers functions of the form 

m 

given by 

H{f) := (A(f),... ,/„(f)), 

with an identical construction (up to notation) for the second stage. The injectivity of H 
will follow from Proposition 4.4 in the first stage, and from (r2) in the second stage. 

The injectivity of H will in turn imply the desired estimates (25) and (34), since m < 
n/2. 

Note that Lemma 4.2 and Proposition 4.3 are no longer needed here, and as a conse- 
quence we do not need any further non- degeneracy conditions on F', other than the one 
from [8], namely that F' is parametrizable over any k canonical variables. 

5. Proof of Theorem 2.5 

Assume d = 2. The argument for d > 2 would follow with no essential modification. 
For each P C S we will use the notation 

n 

Ap(f)(Fi, . . . , := 5^ \R,\-^ J] 0.J|lR.(f). 

seP i=i 

By invoking interpolation and the dilation invariance of our operator, it suffices to prove 
that for each 2 < pj < oo with — + ...-!- — =: ^ and each IIFJL. = 1 we have 

|{f :As(Fi,...,F„)(f)>l}|<l. (43) 

For the remaining part of the argument, the functions Fj will be fixed as above, and all 
sizes will be computed with respect to them. 
Consider the exceptional set 

n 

E^\J{x:M2Fi(x)>l}, 

i=l 

and note that \E\ < 1. 

It now suffices to restrict attention in (43) to the collection (which for simplicity will 
also be denoted with S) of multi-tiles s which have the property that RgHE^ ^ 0. Lemma 
3.5 will now imply that sizej(Si) < 1. 

Apply now Proposition 3.8 to the collections Sj. We get that 

S. U 

2-*<l 
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where 

sizei(Sf < 2-'=+^ (44) 

and each S^-^^ is the (disjoint) union of a family T^''^ of trees such that 

||7V «||,<22^2^^'^/^ l<q<oo. (45) 

An immediate consequence of (45) (choose q large enough) is that 



|{f :iV^(.)(a;)>2^^'}|<2-i°'= 



By eliminating another exceptional set of measure 0(1), it thus suffices to further restrict 
attention in (43) to those x which satisfy 

Nw{x)<2'"' (46) 

for each i and each 2~'^ < 1. 

— * 

We fix some ki for each i, and denote by S{k) the collection of all multi-tiles s with 
Si E Sf''^ for each i. We follow the procedure described in Section 4.2, applied to P := S(A;) 
and J-'i := J-'^^'\ The collection S(A:) will be the union of three families of vector trees: 
jFi* from the first stage of the construction and 

^l--= U U (47) 

l:l<2'i<2*''i f:2-''i <2-'=i+l 

U U ^^(^"'0 

from the second stage of the construction. Due (24) and (46), the family can be 
ignored, since it will not contribute to Ag^^g^. 

Let t be a sufficiently large number. We plan to evaluate ||Ajr;*||t, and in doing so we 

k 

will invoke (25), (45) and Holder's inequality: 



n 



1=1 

Combining this with Lemma 3.6 and estimate (44) for the size, we get 

n 



n 



nt 

i=l 



Since 6 < 1/2, it follows that 

— * 

It now remains to evaluate the contribution coming from the vector trees in (47). Fix r, I. 
As before, by using (31), (34) and Holder's inequality we get 

i=l 
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Combining this with Lemma 3.6 and estimate (29) for the size we get 

n 

iiA.,(.,.-)ii.sn2'""^s-.). 

i=l 

Summing this first over 2"''' < 2"'^'+^, then over < /j < and finally over 2~^' < 1 we 

|{f :Au,^^(Fi,...,F„)(f)>l}|<l. 
This finishes the argument. 
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